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Abstract: Initial state radiation, multiple interactions, and event pileup can contaminate 
jets and degrade event reconstruction. Here we introduce a procedure, jet trimming, de- 
signed to mitigate these sources of contamination in jets initiated by light partons. This 
CN procedure is complimentary to existing methods developed for boosted heavy particles. 

We find that jet trimming can achieve significant improvements in event reconstruction, 
especially at high energy/luminosity hadron colliders like the LHC. 
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1. 


Introduction 





Jets are collections of hadronic four-momenta used to approximate the kinematics of short 
distance scattering events. Since the high-energy frontier is explored by hadron colliders 
with color-rich final states, jets are a necessary tool to better understand the physics of the 
standard model and probe whatever lies beyond it. To assemble jets one must make use 
of jet algorithms — well-defined procedures for collecting detector tracks and calorimeter 
cells into jet four-momenta. Many such algorithms exist, with each exhibiting a different 
clustering behavior.^ Though the choice of jet algorithm introduces some level of ambiguity 
in any jet-based measurement, this is still acceptable, as any infrared/collinear-safe jet 
algorithm will yield results that can be compared to theoretical calculations. 

In general, the optimal jet algorithm for an analysis is the one which most closely 
reconstructs the hard scattering process. The closer the reconstruction is to the true 
scattering, the greater the signal significance.^ Now, if the final states observed in a detector 
only arose from the products of a hard scattering, and if the jets were well-separated from 
each other and from the beamline, then the precise jet definition used would not matter 
very much. In that idealized scenario, the jets would be accurately reconstructed by any 

^For comprehensive reviews and relevant references see Refs. [1, 2]. 

^In principle, the choice of jet algorithm could also help control reducible backgrounds. 
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jet algorithm, as long as the algorithm clustered most of the hadrons arising from final 
state radiation (FSR). 

In reality, however, a detector records more than just the final states from a hard 
scattering event. The incoming states will typically radiate before scattering, leading to 
copious initial state radiation (ISR). In addition, multiple parton interactions (MI) and 
event pileup will further contaminate the final state. ^ This is an especially prominent 
effect at the Large Hadron Collider (LHC) because of its high energy and luminosity. The 
net effect is that hadrons from ISR/MI/pileup are spatially overlapped with hadrons from 
FSR, complicating the jet finding procedure. Thus, there is an inevitable tradeoff. On 
the one hand, we would like a jet algorithm to form jets large enough to cluster all of the 
hard scattering decay products and account for wide angle FSR emissions. On the other 
hand, we are constrained in how large our jets can become by inevitable contamination 
from hadrons unassociated with the hard scattering. 

This conflict between missing radiation and contamination is usually resolved through 
a judicious choice of the jet size parameter (usually the jet radius R). One can either fix 
the jet radius at an optimal value, or employ an algorithm designed to choose the optimal 
size on a jet-by-jet basis (e.g. the VR algorithm [3]). It is possible to go a step further and 
statistically account for the sources of contamination by assuming a diffuse distribution 
and subtracting off a fixed contribution to each jet proportional to its area [4] . However, 
one can take a more aggressive approach by actively working to identify and remove the 
radiation contaminating each jet. The basic idea behind such an approach stems from the 
observation that there is usually only one hard scattering per event; all other sources of 
radiation (ISR/MI/pileup) are likely to be much softer. By going inside a jet and removing 
soft radiation (through a modification of the sequential clustering procedure or through 
the use of subjets), reconstruction can be improved. 

This idea of hierarchical radiation and its potential use in cleaning up contaminated 
jets has gained acceptance in the jet community. In the past, most studies focused on 
boosted hadronically decaying particles like the W/Z [5, 6], Higgs [7, 8], and top [9, 10, 
11, 12],^ where the procedure is optimized toward improving the jet mass resolution. The 
only mention that we are aware of for using such a technique outside of heavy object 
reconstruction is Ref. [15], in which it was observed that applying the same procedure 
useful in reconstructing a boosted Higgs could also help reconstruct jets from light partons. 

In this paper, we present procedures specifically designed to improve the reconstruction 
of ordinary QCD jets arising from the showering and fragmentation of nearly massless 
partons (i.e. light quarks and gluons). To distinguish this from prior work on boosted 
heavy particles (such as jet filtering [7] and jet pruning [11]), we will call our procedures jet 
trimming. In the next section, we will further discuss the contamination of jets and try to 
quantify its effects. In Sec. 3, we will introduce jet trimming algorithms and discuss different 
versions of these applicable to final states in various kinematical regimes. In Sec. 4, we will 

hard scattering event takes place between the partons of two coUiding hadrons. Further interactions 
between those hadrons are called multiple interactions, while interactions between other hadrons in the 
colliding bunches are called pileup. 

^See Refs. [13, 14] for some examples in supersymmetric processes. 
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present the results of our algorithms and compare them both with the untrimmed results 
and with earlier cleaning techniques. We will see that by using algorithms specifically 
designed for light parton jets we can achieve a substantial gain, beyond the improvements 
seen through applying the techniques developed for boosted heavy particles. Sec. 5 contains 
our conclusions. 

2. Trimming QCD Jets 

As discussed in the introduction, jet reconstruction always presents a trade off between 
capturing all of the radiation associated with a hard scattering while at the same time 
minimizing the contamination from other hadrons present in an event. Before we discuss 
this, let us first introduce some notation and provide some details about our study. 

Throughout this paper, we will refer the typical size of a jet in terms of its char- 
acteristic radius R using distances defined on the (rapidity y, azimuth (f>) plane: Ai? = 
(A?/)2 + (A(/))2 . When referring to generic fixed-radius jets and their size (Rq), we are 
implicitly using the anti-kT algorithm [16] for jet reconstruction, as this reasonably approx- 
imates the behavior of an ideal cone algorithm (for a discussion on the behavior of other 
algorithms in reconstruction see Ref. [15]). To generate our Monte Carlo events samples we 
use Pythia 6.4.21 [17] with the default 'Tune-A' [18, 19] settings and assume a 14 TeV 
LHC. Our jets are clustered using Fast Jet 2.4.0 [20, 21]. While the discussion here in 
Sec. 2 will not account for the effects of pileup (so as to demonstrate the irreducible, signif- 
icant effects of ISR/MI contamination), we will factor in the effects of pileup for our results 
in Sec. 4, assuming a relatively modest luminosity per bunch-bunch crossing of 0.05 mb~^. 
To approximate the effects of a real detector, we always group final state partons/hadrons 
into 5?7X(5(/) = 0.1x0.1 calorimeter cells between — 3 < < 3, and assign the cells massless 
four-momenta based on the calorimeter energy. 

Finally, we note that while most aspects of particle collisions calculated in Monte 
Carlo programs rest on firm bases from fundamental physics, the effects of hadronizaton 
are only understood through phenomenological models.^ This might seem to be cause for 
concern, as our results will to some extent reflect the effects of hadronization, but we expect 
these dependencies to be small, altering perturbatively calculated jet/subjet momenta by 
O(Aqcd)- While we will operate under this assumption for the rest of the article, the 
validation of hadronization models will be an important task at the LHC. 

2.1 The Effects of Contamination 

In absence of ISR/MI contamination, a large R is desirable in the context of traditional 
jet clustering. To see why, consider the process gg ^ (j) ^ gg where (/> is a new color 
octet scalar with a mass of 500 GeV and a narrow width. ^ In a showering Monte Carlo 
program without hadronization, FSR is factorized from ISR/MI, so one can study the FSR 

^Hadronizaton is modeled in Pythia using the Lund model [22] , which has been successful in reproducing 
collider data [23]. 

®The cj> couples to gluons via the operator Tv{<j)G^ii,G^"). For comparison, we will also consider a different 
color octet scalar (f> that couples to fermions via a Yukawa coupling q(j>q. 
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Figure 1: Reconstructed = 500 GeV from gg ^ (p ^ gg dijet events with FSR only (left) and 
with the addition of ISR/MI (right). In the absence of ISR/MI larger jet radii are preferred, while 
when ISR/MI are turned on a smaller radii must be used to balance the effects of contamination. 



in isolation. On the left side of Fig. 1, we show the distribution of the reconstructed (j) 
mass using only FSR for various values of the anti-ZcT jet radius Rq. One sees that as Rq 
increases, the reconstructed invariant mass distribution approaches the narrowly peaked 
distribution predicted from the hard scattering. 

However, when one includes the effect of contamination, larger values of Rq can yield 
poorer reconstruction, as seen from the right side of Fig. 1. Here, the jet radius that most 
closely matches the desired peak position is around -Rmax = 1-1) considerably smaller than 
what one would want to use considering FSR alone. From this one can see that an optimal 
jet algorithm would be one with a large overall jet radius that somehow avoids clustering 
in hadrons from ISR/MI (as well as pileup). 

Now, there is always a minimum spatial overlap between FSR and ISR/MI from the 
fact that the two sources of hadrons could end up nearby in the detector. Fortunately, this 
overlap is relatively small. In Fig. 2 we present the (p mass reconstructed using Rq = 1.5 
where only those calorimeter cells within AR = 0.2 ^ of one containing at least 1 GeV of 
FSR were clustered, along with the distribution obtained without this restriction. The re- 
stricted distribution is quite close to the one where only FSR was clustered, confirming the 
minimum spatial overlap. By considering this sort of restriction to FSR-heavy cells, one 
can calculate the maximum possible reconstruction improvement in going from ordinary 
cones to such an idealized jet algorithm. This is shown in Table 1, where the improve- 
ment is measured by the reconstruction measure A presented in Sec. 4. We see potential 
improvements of up to 3x in reconstruction. Of course, such an idealized jet algorithm 
cannot exist since no physical observable can distinguish between FSR and ISR/MI, but 
the room for improvement is compelling. 

''With hadronization turned on, there are non-trivial correlations between FSR and ISR. 
*While at this point the choice of AR — 0.2 is somewhat arbitrary, later in Sec. 4 we will see that this 
is a reasonable subjet radius for use in trimming. 
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Figure 2: Reconstructed = 500 GeV clustered with Rq = 1.5 for gg ^ cj) gg (left) and 
qq ^ (j) ^ qq (right). The blue curve shows the reconstruction from a sample without ISR/Ml. The 
red and black curves show data from a sample with ISR/MI, where all cells are clustered (red), and 
where only those cells within AR = 0.2 of an cell containing more than 1 GeV of FSR (black). The 
similarity between the black and blue peaks demonstrates that considerable gains in reconstruction 
are possible despite the irreducible overlap in radiation. 





Improvement 


Ro 


r [GeV] 


M [GeV] 




gg ^ (p^ gg 


All cells 
FSR cells 


309% 


1.2 
1.5 


69 
15 


518 
501 




qq^ (p^ qq 


All cells 
FSR cells 


189% 


0.8 
1.5 


31 
11 


505 
501 



Table 1: Improvement in the resonance reconstruction measure A presented in Sec. 4 in going 
from standard clustering {All cells) to an idealized situation where we only cluster those cells within 
AR = 0.2 of an ceh containing more than 1 GeV of FSR {FSR cells). Here = 500 GeV. The 
definitions of F and M appear in Eq. (4.1). Because of the larger color charge of gluons compared 
to quarks, there is more radiation in the — > — >■ gg case compared to the qq ^ (f> ^ qq case, so 
the potential improvement is correspondingly larger. 



The goal our jet trimming algorithm is to approach this ideal reconstruction as closely 
as possible. To do so, we need some kind of criteria to determine whether a given patch of 
the calorimeter is likely to contain substantial amounts of FSR. In light of the observation 
that ISR/MI (as well as pileup) is usually soft compared to FSR, the simplest possible 
criteria we have is relative transverse momentum. As shown in Fig. 3, in a typical jet 
ISR/MI makes up only 0(1 — 5%) of the jet's pT (the contribution of pileup is a luminosity 
dependent question), and we saw earlier that there is minimal spatial overlap between 
contamination and FSR. Therefore, sources of contamination can be mitigated by simply 
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Figure 3: Fraction of a jet's pr attributable to ISR/MI for gg ^ (f> ^ gg (left) and qq 
(right). 
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removing patches of soft calorimeter cells. 
2.2 QCD Jets vs. Boosted Objects 

While the general idea of removing soft calorimeter cells is straightforward, a number of 
details remain unspecified. At minimum, one wants to consider patches of calorimeter cells 
by clustering them into subjets of radius i?sub > Sca.i = 0.1 to remove any sensitivity of the 
procedure to calorimeter segmentation. Beyond that, one must specify how the subjets 
are to be formed, how large they will be, and what will serve as the criterion for softness. 
As we will argue, by choosing jet trimming parameters in a way designed to enhance the 
reconstruction of light parton jets, we can increase reconstruction performance beyond the 
current techniques designed for boosted heavy particles [5, 6, 7, 8, 9, 10, 11, 12, 13, 14]. 

To see how one might go about choosing trimming parameters, consider first how they 
would be chosen to reconstruct the jet from a boosted heavy particle. Usually such a 
particle decays immediately into two (e.g. the Higgs or W/Z) or three (e.g. the top) final 
states, each at the same characteristic pT scale (barring a matrix element conspiracy). 
These states will shower into distinct hard patches in the jet (see the left panel of Fig. 4), 
so one can hope to remove contamination from the system by simply associating a subjet 
to each hard final state and discarding everything else. That is, one would discard all 
but the A'^cut hardest subjets. Whether or not a particular subjet from a boosted heavy 
particle is considered soft depends upon where the subjet ranks in the subjet pT ordering 
and upon how many final state partons we expect in the decay. For instance, if we are 
looking to reconstruct a Higgs in its decay /i — ?• 66 we would form subjets inside the initial 
jet and discard all but the hardest two.^ In this context, the natural size of a subjet is also 
relatively clear; to treat each final state of the decay equally (as we should, since they have 



^Sometimes analyses allow for one subjet beyond the minimum number of tree level final states in order 
to capture the first emission [7], but the principle remains the same. 
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Figure 4: Comparison of a jet formed from the decay of a boosted heavy particle (left) with one 
from the showering of light flavor/gluons (right). Specifically, the left hand panel shows the jet 
formed from h bb while the right is a giuon jet. The {x, 2/)-axes are (y, (/))-distances as measured 
from the jet center and the area of each calorimeter cell is proportional to its px- 



comparable pts) we are limited to i?sub i$ ^o/2 under the assumption that the initial jet 
was chosen to be just large enough to encompass the entire decay of the heavy particle. 

The situation changes when we consider jets from light quarks or gluons (compare 
the two panels in Fig. 4). The first difference is that there is only one hard final state at 
lowest order in a^. Softness is therefore more naturally established directly via a cut on 
subjet pt rather than by restricting to a fixed number of subjets. Later we will establish 
different subjet pT cuts for different kinematic regimes. The second difference is that there 
is no natural size for the subjets as this depends upon the the px cut for the subjets; a 
larger/smaller subjet size will necessitate a harder/softer subjet pT cut. With these two 
differences in mind, we can now define our jet trimming procedure. 



3. Implementation 

In this section, we present an explicit algorithm implementing the jet trimming technique 
outlined above. Our choice of algorithm is motivated primarily by simplicity and the 
ability to re-use existing jet finding procedures. Many more sophisticated choices could 
easily be imagined, but these are beyond the scope of the present work. 

Since our jet trimming procedure will make use of well-known sequential recombination 
jet algorithms, we will briefly review how these work. Recall that in a recursive jet algorithm 
one begins with an initial set of four-momenta (these could be tracks, calorimeter cells, etc.), 
assigning every pair a "jet-jet distance measure" dij and every individual four-momenta a 



^'^Our implementation is available as a plug-in to the Fast Jet package [20, 21], which is available from 
the authors upon request. 
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"jet-beam distance measure" diB- The distance measures relevant for our study are: 

diB = -Q-) (3.1) 

(3.2) 

diB = PTi (3.3) 
d,B = ^. (3.4) 

PTi 

At each step in the clustering, the smallest entry in the set of all dij and dis is identified. 
When a jet-jet distance is the smallest, the corresponding four-momenta are merged, while 
if a jet-beam distance is the smallest, then the associated four- momentum is "merged with 
the beam" and set aside. Here we will deal entirely with inclusive algorithms, where the 
recursion continues until all jets are merged with the beam, and the algorithm returns 
those merged jets whose px is greater than some minimum value. 

3.1 Jet Trimming 

The jet trimming procedure we advocate is an "outside-in" algorithm, meaning that a seed 
jet determined through one jet finding method is reclustered using a subjet finding method. 
Then a softness criteria is applied to the individual subjets to determine the final trimmed 
jet. One could also imagine an "inside-out" algorithm, where small subjets are found first, 
and clustering into a larger jet, again using some kind of softness criteria, but we will not 
explore that option here. 

The proposed algorithm proceeds as follows: 

1. Cluster all cells/tracks into jets using any clustering algorithm. The resulting jets 
are called the seed jets. 

2. Within each seed jet, recluster the constituents using a (possibly different) jet al- 
gorithm into subjets with a characteristic radius -Rgub smaller than that of the seed 
jet. 

3. Consider each subjet, and discard the contributions of subjet i to the associated seed 
jet if PTi < /cut • Ahard) where /cut is a fixed dimensionless parameter, and A^ard is 
some hard scale chosen depending upon the kinematics of the event. 

4. Assemble the remaining subjets into the trimmed jet. 

^^For jet algorithm aficionados, we use "VR" to refer to the "AKT-VR" algorithm of Ref. [3]. 
^^In an exclusive algorithm, the recursion stops when a predetermined distance measure dcut is reached, 
at which point the unmerged jets are returned. 
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Figure 5: Step by step illustration of the jet trimming procedure. Proceeding from left to right, 
top to bottom, we show a jet as initially clustered (using anti-fcr with Rq — 1.5), the constituent 
kx subjets with i?sub = 0.2, the subjets surviving the pTi < /cut ■ Pt cut (where /cut — 0.03), and 
the final trimmed jet. To make the figure easier to read, the area of each cell is proportional to the 
log of the cell's px- 

This procedure is illustrated in Figs. 5 and 6. The dimensionless parameter /cut quanti- 
fies the expected px scale hierarchy between FSR and ISR/MI/pileup. In principle, this 
procedure could be iterated such that subjets that fail the softness criteria in one seed jet 
could be tested for inclusion in a different seed jet. However, this is only relevant if the 
original jets were effectively overlapping, or if the removal of subjets substantially changes 
the position of the trimmed jets relative to the original seed jets. 

The precise jet definition used in step 1 is largely irrelevant for the jet trimming 
procedure. In Sec. 4, we will trim two different jet algorithms, anti-fer [16] and VR [3], 
finding improvements in reconstruction with both. 

The jet definition used in step 2, however, is more important as it determines how the 
subjets are found. We use the kx algorithm [26, 27] rather than a Cambridge-Aachen [24, 
25] or anti-Zcr algorithm [16], because subjets formed by the k^ algorithm tend to better 
share the energy between subjets. That is, imagine that the dominant FSR depositions in 
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Figure 6: The jet of Fig. 5 before (top left) and after (top right) trimming using a hnear scale 
where a cell's area is proportional to its pr- Also shown in the lower panel is the catchment area of 
the jet [4] , where the empty black squares indicate cells that would have been clustered in the final 
trimmed jet if all cells were given an infinitesimal amount of radiation. As we will discuss more in 
Sec. 4.4, the jet's area has been dramatically reduced, here to around 8% of its untrimmed value. 



a seed jet cannot be contained within a single subjet of size -Rgub- In such circumstances 
the anti-/cT algorithm, which clusters radiation from hardest to softest, will tend to create 
imbalanced subjets by allocating most of the energy to one subjet, making it more likely 
that the weaker subjet will be discarded when the softness criteria is applied. As the kx 
algorithm clusters from softest to hardest, it is more likely to yield a equitable distribution 
of energy between the subjet that contain FSR, making them less likely to be discarded by 
the trimming procedure. 

Finally, we must select a Ahard to set our criterion of hardness when judging a subjet 's 
Pt- This is a non-trivial choice, as different kinematical configurations call for different 
scales, and the difference in reconstruction from different scale choices can be large. To 
illustrate this, in Sec. 4 we will present two possibilities for Ahard : the seed jet's pT, and 
the effective mass of the event (i.e. the scalar sum of the transverse momenta: H = YIpt)- 
While we have only considered the simplest two scale choices, it would be interesting to 
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investigate more complicated methods to see if additional gains could be realized. 
3.2 Comparison to Previous Methods 

As argued before, most techniques useful in removing contamination from the jets of 
boosted heavy particles keep a fixed number A'cut of hard subjets. To enable an apples-to- 
apples comparison of /cut vs. A'cuti we will simply take the jet trimming algorithm above, 
replacing step 3 with: 

3. Sort the subjets according to pT and discard the contributions of those softer than 
the A'^cut-th hardest subjet. 

This allows us to employ a condensed notation in discussing various trimming procedures. 
We can denote different algorithms by 

alg({/,iV},{pT,^}), (3.5) 

where "alg" is the algorithm used to make the seed jets (here anti-/cr or VR), {/, A} 
specifies whether we will trim all subjets with a px below /cut ■ Ahard or include only the 
Acut hardest subjets, and the final entry (only relevant for /cut algorithms) species whether 
we will use the px of the seed jet or the effective mass of the event to set Ahard- 

In addition to this fixed Acut algorithm — which we believe represents the most advan- 
tageous application of previous techniques for boosted objects to the study of light parton 
jets — we will also include a direct implementation of an algorithm from an earlier study. 
We will present results using the jet filtering technique of Ref. [7] (labelled Filtering below) 
which takes a jet and only includes those constituents that fit into the two hardest C/A 
subjets formed from cones of size Rq/2. 

We were also interested in testing the jet pruning procedure of Ref. [11], since it could 
be considered as something of a middle ground between trimming and the cleaning methods 
that cut on a fixed number of subjets. ^"^ Pruning functions by attempting to remove 
spurious mergings in the clustering sequence: after a seed jet is formed, its constituents 
are reclustered using the kx or C/A algorithm, and if the jet algorithm attempts to merge 
widely separated (Ai?, > 2Rpmj /px) four-momenta with a large px hierarchy {z < 2;cut)^^ 
then the merging is rejected, and the softer of the two four-momenta deleted. Pruning is 
most effective at removing spurious mergings from the later stages of clustering (i.e. right 
before the jet is complete), which is precisely what it should do to reconstruct a boosted 
heavy particle. 

However, in our studies on pruning light quark jets we found at best only a marginal 
improvement in reconstruction. This occurred when the optimized value of i?p was essen- 
tially zero, meaning that the Zcut criteria was being applied at every stage of the recon- 
struction.^^ We suspect that the reason pruning is not working well in this context is that 

^^The implementation of pruning we tested was FastPrune 0.3.0 [28]. 
^■'Here z = min[pTa,PTb]/pTtot for four-momenta pa and pb with ptot = Pa +Pb- 

^^Because the improvements we see with pruning arc small, and occur at parameters at which the pro- 
cedure is uncomfortably sensitive to the calorimeter segmentation, we will not report them in the next 
section. 
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far enough down the hne in the parton shower, there is no longer a clear scale separation 
between FSR and contamination. So while pruning employs a relative pT cut (as in jet 
trimming), it appears to be most effective when employed on a jet with a fixed, small num- 
ber of hard subjets. It is an open question whether pruning techniques might be modified 
to successfully clean light quark jets. 



4. Results 

We now apply the above jet trimming procedure to two examples in different kinematic 
regimes: heavy resonance reconstruction, and a two-step decay chain. Our goal is justify 
the use of trimming, show that it is advantageous to use a trimming procedure specifically 
designed for jets from light partons, and to see how different measures of Ahard can change 
the reconstruction of the trimmed event. Unlike Sec. 2, here we will include both the effects 
of ISR/MI and event pileup. 

In both examples, we will find that employing any sort of trimming procedure leads 
to an improvement in reconstruction. However, in going from an algorithm designed for 
boosted heavy particles to one specifically aimed at light parton jets, we can realize signif- 
icant additional gains. Further, using a measure of hardness well suited to the kinematics 
of an event can make almost as big a difference in reconstruction as to the decision to trim 
in the first place. 

Our results confirm our intuitions from Sec. 2 that trimming partially resolves the 
jet-size/contamination tradeoff. For the anti-^T algorithms, the optimal Rq value in the 
trimmed sample is systematically larger than the optimal Rq value in the untrimmed 
sample. Similar conclusions hold for VR, with the jet size parameter p being larger in 
the trimmed samples. We will find that background dijet distributions are not increased 
through the use of a large initial radius, and may even be reduced in some cases. Finally, 
as expected, the active jet area [4] is substantially smaller in the trimmed sample. 

To quantify reconstruction performance, we will fit reconstructed invariant mass dis- 
tributions to a sum of two distributions (similar to what was done in Ref. [11]): 



S{m) = a 



1 + /3(m - M) 



(4.1) 



(m2-M2)2 + r2M2_ 
B{m) = 5 + -i/m, (4.2) 

where 5 and 7 are restricted to be > 0. Here S{m) is a skewed Breit-Wigner distribution 
and B{m) is a background-like falling distribution. We quantify signal reconstruction via 
the measure 

A-^W = f2^' (4-3) 

i.e. the peak height of the S{m) curve. While other measures of reconstruction perfor- 
mance would be equally reasonable, this measure favors algorithms reconstructing a tall 
S{m) of narrow width, and has the advantage of not introducing any arbitrary parameters 



^®In VR algorithms, the radius of a jet is approximately R ~ p/pt, where pr is the jet's transverse 
momentum. 
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Table 2: Comparison of dijet resonance reconstruction using trimmed and untrimmed algorithms. 
The first column specifies the algorithm, the second lists the change in A over untrimmed anti-fc-r 
(second row), the third lists the relevant trimming parameters, the fourth contains the subjet radius, 
the fifth the seed jet parameters, the sixth the fitted width, and the seventh the fitted mass. For 
each algorithm, we have optimized those parameters denoted by a *, while the rest have remained 
fixed. 

beyond the fitted functional form. Note that this reconstruction measure does not attempt 
to reward algorithms that get the right peak position, and we will see a corresponding 
systematic invariant mass shift in using trimmed jets. 

For simplicity of discussion, we only consider processes with initial/final state gluons. 
From Table 1, we see that improvements are certainly possible when these are replaced 
with light quarks, and all of our conclusions regarding the optimal trimming method will 
hold there as well. It is important to remember, though, that quarks have a lower effective 
color charge then gluons and thus produce less QCD radiation. Thus, for light quarks one 
expects (and we found) a diminished optimal untrimmed jet radius and a lower potential 
improvement achievable through trimming. 

Finally, one should keep in mind that while the improvements we find are the result of 
well understood physical effects, the precise values of the trimming parameters will change 
somewhat when the Monte Carlo tuning is adjusted to account for LHC data. Thus, while 
the parameters below will provide a reasonable guide to what should be used at the LHC, 
the exact values will need to be inferred from a iterative process of Monte Carlo tuning to 
standard candles. 

4.1 Heavy Resonance Decays 

The simplest test of a jet algorithm is how it reconstructs a heavy resonance decaying to 
the two jets. As in Sec. 2, we use the process gg ^ 4> ^ gg where 4> \s a, color octet scalar 
with = 500 GeV. 

The results of this reconstruction are presented in Table 2. Here we are interested 
primarily in two different comparisons: untrimmed algorithms versus those trimmed using 
an /cut (so as to measure the full potential for improvement in reconstruction), and those 
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Mass [GeV] Mass [GeV] 

Figure 7: Dijet resonance reconstruction with and without trimming using the anti-fcr/VR and 
anti-ZcT/VR (/, H) algorithms. The algorithm parameters are those that optimize the A measure 
of Eq. (4.3), as listed in Table 2. The upper curves are fitted to the sum of S{m) and B{m) from 
Eqs. (4.1) and (4.2), while the lower curves display the contribution of B{m). 

trimmed using an A'cut to those using an /cut- Now, the more parameter choices one 
optimizes in an algorithm the more that algorithm stands to gain from arbitrary statistical 
fluctuations. To guard against this and ensure that the first comparison above is fair, we 
fully optimize the anti-Zcy {N) algorithm, using the resulting best choices of i?sub and Rq as 
inputs to our optimization of anti-A;^" (/), for which we only optimize a single parameter: 
/cut- The result is a fair comparison of untrimmed algorithms to those trimmed with an 
/cut? and a comparison of A'cut to /cut trimming where A'cut trimming is given a statistical 
advantage. 

Several algorithms and trimming procedures are presented in Table 2. We have in- 
cluded untrimmed anti-ZcT) anti-Zcr with a cut on the momenta of subjets (set relative to 
both the jet's px and the event's effective mass), anti-fcr with a fixed number of kx subjets, 
and for comparison with previous techniques anti-ZcT with two C/A subjets of half the seed 
jet radius (i.e. the filtering procedure of Ref. [7]). Both trimmed and untrimmed VR jets 
are also included. In Fig. 7, we display the reconstructed (j) mass using both trimmed and 
untrimmed anti-/cT and VR algorithms. 

We see that trimming of any sort is useful in reconstruction. However, the difference 
between trimming techniques is apparent. By using an algorithm with a pT cut determined 
as a fraction of the original pT (i-e. the samples whose trimming is parameterized by an 
/cut) we are able to see significant gains beyond what is possible using a fixed number of 
subjets. This refiects the fact that the structure of the jet from a light parton is not known 
a-priori, unlike the jets from boosted heavy particles, so it is advantageous to trim with 
a direct subjet pT cut. We further note that at this stage, the difference between using 
H and px to set A^ard niakes only a small difference in reconstruction, reflecting the fact 

^^For the VR algorithms we will take the anti-fcr optimized Ro, /cut, and A'cut as inputs {Ra will set 
iimax) and optimize the p parameter. 
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Table 3: Comparison of the 2 — > 4 resonance reconstruction using trimmed and untrimmed algo- 
rithms. Reconstruction is performed by taking the invariant mass of the hardest four jets. The first 
column specifies the algorithm, the second lists the change in A over untrimmed anti-fcr (second 
row), the third lists the trimming parameters, the fourth contains the subjet radius, the fifth the 
seed jet parameters, the sixth the fitted width, and the seventh the fitted mass. For each algorithm 
we have optimized those parameters denoted by a * , while the rest have remained fixed. 

that for dijet events ~ i^/2. Below, we will see that the situation will change in more 
complicated event topologies. 

Before continuing, we remark that in Fig. 7, the dijet invariant mass distribution is 
systematically shifted to lower values through the effects of jet trimming. This is to be 
expected, given that the trimming procedure will necessarily result in some accidental 
removal of FSR. To understand the size of the effect, note that in Table 2 we find an 
optimized /cut of around 3% when we cut on the subjet 's relative to that of the seed 
jet, and that the optimal A'^cut for fixed-number cleaning is 5. Since the pattern of QCD 
radiation from a light parton ensures us that the sub jets follow a strong hierarchy, we 
should only expect one or two subjets to be slightly below the 3% cut we have imposed. 
This is enough to account for the roughly 5% shift in M that we observe. 

4.2 Longer Decay Chains 

Next, we consider the production channel gg ^ X ^ YY — t- gggg where mx = 1 TeV 
and my = 300 GeV. This sample is qualitatively different from the dijet reconstruction in 
two ways: the final state is more crowded, and the final state jets can vary widely in px 
within the same event. The results from this reconstruction are presented in Table 3, and 
the resulting mx and my distributions are plotted in Fig. 8 and Fig. 9, respectively. 

That the final state is crowded somewhat limits the improvements achievable from 
trimming. We saw before in Table 2 that trimming seemed to work well when the seed 
jets were allowed to grow much larger than the optimized untrimmed jets. Here, the 
untrimmed jets are optimized at Rq = 0.8, so the trimmed jets cannot grow much larger 
without merging with each other and ruining the reconstruction. Despite this limitation, 
however, we see that valuable improvements are still possible. 
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Figure 8: Reconstruction of the resonance mx = 1 TeV in gg X YY 9999 with 
and without trimming using the anti-fcr/VR and anti-fcr/VR (/, H) algorithms. The algorithm 
parameters are those optimized to those that optimize the A measure of Eq. (4.3), as listed in 
Table 3. The upper curves are fitted to the sum of S{m) and B{m) from Eqns. (4.1) and (4.2), 
while the lower curves represent the contribution of B{m). 




Figure 9: Reconstruction of the intermediate resonance mass my — 300 GeV in the process 
gg ^ X ^ YY — > gggg using the anti-fcr/VR and anti-fcr/VR i.f,H) parameters of Table 3. 
These distributions arc formed by taking the four hardest jets in each event, considering the two 
masses from every possible 2x2 partition of these jets, and plotting the masses from the most 
equitable partition (defined as the one for which TOmin/'TT-max is closest to one). 



More importantly, now we see that the choice of Ahard can make a significant difference 
in reconstruction. When Ahard is chosen to be the effective mass of the event, reconstruction 
is improved beyond the case where Ahard is the seed jet px (the improvement roughly 
doubles). This is because when we let the seed jet pj- determine the hard scale for each 
jet while using a fixed fcnt, the softer jets will see little trimming (because the minimum 
subjet pt is soft), while for the same reason the harder jets will see too much trimming. 
The resolution, it seems, is to simply use a global pT cutoff for each event when the signal 
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Figure 10: Standard model QCD dijet background reconstruction with and without trimming 
using the anti-fc-r/VR and anti-ZcT/VR (/, H) algorithms and the optimized signal parameters 
from Table 2. 

jets are of different characteristic pT scales. 

4.3 Dijet Backgrounds 

The improvements in signal reconstruction seen so far would be of little use if jet trimming 
significantly increased the background as well. After all, to see improvements in signal 
reconstruction we must let our seed jets cluster with a large radius, and it is possible that 
this could result in an unintended rise in the background distributions. 

Fortunately, this does not seem to be the case. In Fig. 10 we present the background 
QCD dijet invariant mass distributions clustered using the parameters of Table 2 optimized 
for signal reconstruction. If anything, we see that the trimmed distributions are shifted to 
lower invariant mass values than the untrimmed distributions. This is especially useful in 
the case of the VR jet algorithm, which on its own can distort background distributions 
to higher values. It is tempting to argue from this that trimming can also be useful 
in reducing the background, but one should be careful drawing such a conclusion as the 
signal position also shifts. The precise signal and background interplay, while intriguing, 
is therefore likely to be highly process dependent, and requires a dedicated study. 

4.4 Jet Area 

In Fig. 2, we argued that the overlap of ISR/MI with FSR was minimal, so even though 
the naive area of the jets employed in our analysis is quite large, there should not be 
significant sensitivity to the effects of ISR/MI/pileup that we set out to avoid. We can 
quantify this statement using the catchment area of a jet [4], allowing us to directly measure 
the sensitivity of the trimmed jet to uniform diffuse contamination. We find that while the 
jets we use in trimming start with large areas, after the jet trimming procedure is applied 

^*In Ref. [3] we had to impose a jet-quality cut to prevent this distortion. Witli jet trimming, it seems 
that such a cut is unnecessary. 



-17- 
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Figure 11: Jet area, defined as the area of calorimeter cells clustered into a jet if each cell contains 
at least an infinitesimal about of radiation, for anti-fcy vs. anti-fcT(/, H) (left panel) and VR vs. 
VR(/, H) (right panel), using the optimized parameters from Table 2. The area of the untrimmed 
algorithm is roughly 7ri?g, as expected, while the trimmed jet's area is much smaller. 

the active area decreases dramatically, as shown in Fig. 11. In fact, the active area after 
trimming is even less than that of the untrimmed jet which began with a smaller radius. 

One caution, however, is that the catchment area only captures the sensitivity to soft 
contamination. Trimming cannot guard against a fluctuation of ISR/MI/pileup that yields 
a hard subjet above the /cut threshold. In some ways, trimming accentuates such fluctu- 
ations, since the contamination cannot be averaged over a larger jet area and subtracted 
statistically using, e.g. the methods of Ref. [4]. Therefore, more detailed studies are needed 
to really understand such systematic biases. 

5. Conclusions 

In this paper, we have proposed jet trimming as a way to improve jet reconstruction by 
mitigating the spatial overlap between FSR and ISR/MI/pileup in hadronic collisions. This 
technique actively removes sources of contamination by exploiting the difference in scale 
between the hard emissions of FSR and the relatively soft emissions from ISR/MI/pileup. 
While prior efforts had been made along similar lines, those efforts focused on removing 
contamination from the jets of heavy boosted objects. We have shown that light parton jets 
benefit from methods that emphasize relative subjet px instead of the number of subjets. 

We presented an explicit algorithm that implements jet trimming. Our algorithm 
begins with seed jets constructed through any means (here we employ anti-feT and VR), 
which are then reclustered using an inclusive /ct algorithm and trimmed according to a 
subjet pt cut set relative to some hard scale determined by the kinematics of the event. 
In two different kinematic configurations, we find large improvements in reconstruction 
efficiency from using trimmed jets. Moreover, unlike our previous VR algorithm [3], this 
improvement was obtained without a drastic increase in the catchment area of the jet. 
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Further study is necessary to understand how this jet trimming procedure would affect 
jet systematic errors in an actual experimental context. For example, jet energy scale sys- 
tematics already require a correction from ISR/MI/pileup contamination, and exactly how 
a jet energy correction would be applied in the case of trimmed jets is unclear. However, by 
addressing ISR/MI/pileup contamination on a jet-by-jet basis, we expect that the system- 
atic uncertainty associated with trimmed jets should not be any worse than for fixed-radius 
jets. Moreover, it would be interesting to see whether the systematic shift in the invariant 
mass peak from accidentally throwing away FSR subjets could be fixed through a simple 
jet energy rescaling. 

Finally, while the improvement in reconstruction from trimming is already quite help- 
ful, it is nowhere near the in-principle improvement we saw in Sec. 2. Perhaps further 
advances can be made through a better choice of the Ahard parameter or a different subjet 
finding procedure. Whether any jet trimming algorithm can ever hope to approach the 
theoretical limit in ISR/MI/pileup rejection is an important open question, but the gains 
already seen in a simple trimming algorithm recommend its use at the LHC. 
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